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1) GENERAL INFORMATION: 

(i) APPLICANT: Choulika, Andre 
Perr in , Arnaud 
Dujon, Bernard 
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(ii) TITLE OF INVENTION: Nucleotide Sequence Encoding the Enzyme 
I-SCEI and the Uses Thereof 

(iii) NUMBER OF SEQUENCES: 52 

(iv) CORRESPONDENCE ADDRESS: 
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(D) STATE: D.C. 

(E) COUNTRY: USA 
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(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 
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(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: Unknown 
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(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 09,196,131 
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(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/879,689 

(B) FILING DATE: 05-MAY-1992 



(viii) 



ATTORNEY/AGENT INFORMATION : 
(A) NAME: Meyers, Kenneth J. 



(B) REGISTRATION NUMBER: 25,146 

(C) REFERENCE/DOCKET NUMBER: 3495-0111-12 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-408-4000 

(B) TELEFAX: 202-408-4400 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



-0 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ATGCATATGA AAAACATCAA AAAAAACCAG GTAATGAACC TCGGTCCGAA CTCTAAACTG 60 

CTGAAAGAAT ACAAATCCCA GCTGATCGAA CTGAACATCG AACAGTTCGA AGCAGGTATC 12 0 

GGTCTGATCC TGGGTGATGC TTACATCCGT TCTCGTGATG AAGGTAAAAC CTACTGTATG 18 0 

CAGTTCGAGT GGAAAAACAA AGCATACATG GACCACGTAT GTCTGCTGTA CGATCAGTGG 24 0 



^ GTACTGTCCC CGCCGCACAA AAAAGAACGT GTTAACCACC TGGGTAACCT GGTAATCACC 3 00 

f 

M 3 TGGGGCGCCC AGACTTTCAA ACACCAAGCT TTCAACAAAC TGGCTAACCT GTTCATCGTT 3 60 

1 

G AACAACAAAA AAACCATCCC GAACAACCTG GTTGAAAACT ACCTGACCCC GATGTCTCTG 42 0 

GCATACTGGT TCATGGATGA TGGTGGTAAA TGGGATTACA ACAAAAACTC TACCAACAAA 4 80 

TCGATCGTAC TGAACACCCA GTCTTTCACT TTCGAAGAAG TAGAATACCT GGTTAAGGGT 54 0 

CTGCGTAACA AATTCCAACT GAACTGTTAC GTAAAAATCA ACAAAAACAA ACCGATCATC 60 0 

TACATCGATT CTATGTCTTA CCTGATCTTC TACAACCTGA TCAAACCGTA CCTGATCCCG 660 

CAGATGATGT ACAAACTGCC GAACACTATC TCCTCCGAAA CTTTCCTGAA ATAA 714 
(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



55" 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met His Met Lys Asn lie Lys Lys Asn Gin Val Met Asn Leu Gly Pro 
15 10 15 

Asn Ser Lys Leu Leu Lys Glu Tyr Lys Ser Gin Leu lie Glu Leu Asn 
20 25 30 

lie Glu Gin Phe Glu Ala Gly lie Gly Leu lie Leu Gly Asp Ala Tyr 
35 40 45 

lie Arg Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gin Phe Glu Trp 
50 55 60 

Lys Asn Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gin Trp 
65 70 75 80 

Val Leu Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn 

Leu Val lie Thr Trp Gly Ala Gin Thr Phe Lys His Gin Ala Phe Asn 
1^" 100 105 110 

in 

Lys Leu Ala Asn Leu Phe lie Val Asn Asn Lys Lys Thr lie Pro Asn 
^ 115 120 125 

?! 

P 

P Asn Leu Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe 

f* 130 135 140 



Met Asp Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys 

145 150 155 160 

Ser lie Val Leu Asn Thr Gin Ser Phe Thr Phe Glu Glu Val Glu Tyr 

165 170 175 

Leu Val Lys Gly Leu Arg Asn Lys Phe Gin Leu Asn Cys Tyr Val Lys 

180 185 190 

lie Asn Lys Asn Lys Pro lie lie Tyr lie Asp Ser Met Ser Tyr Leu 

195 200 205 

lie Phe Tyr Asn Leu lie Lys Pro Tyr Leu lie Pro Gin Met Met Tyr 

210 215 220 

Lys Leu Pro Asn Thr lie Ser Ser Glu Thr Phe Leu Lys 

225 230 235 

(2) INFORMATION FOR SEQ ID NO : 3 : 



ft 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

AAAAATAAAA TCATATGAAA AATATTAAAA AAAATCAAGT AATCAATCTC GGTCCTATTT 60 

CTAAATTATT AAAAGAATAT AAATCACAAT TAATTGAATT AAATATTGAA CAATTTGAAG 12 0 

CAGGTATTGG TTTAATTTTA GGAGATGCTT ATATTCGTAG TCGTGATGAA GGTAAAACTT 180 

ATTGTATGCA ATTTGAGTGG AAAAATAAGG CATACATGGA TCATGTATGT TTATTATATG 24 0 

Q ATCAATGGGT ATTATCACCT CCTCATAAAA AAGAAAGAGT TAATCATTTA GGTAATTTAG 3 00 

IB TAATTACCTG GGGAGCTCAA ACTTTTAAAC ATCAAGCTTT TAATAAATTA GCTAACTTAT 3 60 

\Jl TTATTGTAAA TAATAAAAAA CTTATTCCTA ATAATTTAGT TGAAAATTAT TTAACACCTA 42 0 

M> 

jjl TGAGTCTGGC ATATTGGTTT ATGGATGATG GAGGTAAATG GGATTATAAT AAAAATTCTC 4 80 

TTAATAAAAG TATTGTATTA AATACACAAA GTTTTACTTT TGAAGAAGTA GAATATTTAC 540 

it 

o 

** TTAAAGGTTT AAGAAATAAA TTTCAATTAA ATTGTTATGT TAAAATTAAT AAAAATAAAC 600 

■ M 

ja * CAATTATTTA TATTGATTCT ATGAGTTATC TGATTTTTTA TAATTTAATT AAACCTTATT 66 0 

Q 

TAATTCCTCA AATGATGTAT AAACTGCCTA ATACTATTTC ATCCGAAACT TTTTTAAAAT 72 0 

AA 722 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Lys Asn lie Lys Lys Asn Gin Val Met Asn Leu Gly Pro Asn Ser 
15 10 15 



51 
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las? 



Lys Leu Leu Lys Glu Tyr Lys Ser Gin Leu lie Glu Leu Asn lie Glu 
20 25 30 

Gin Phe Glu Ala Gly lie Gly Leu lie Leu Gly Asp Ala Tyr lie Arg 
35 40 45 

Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gin Phe Glu Trp Lys Asn 
50 55 60 

Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gin Trp Val Leu 
65 70 75 80 

Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn Leu Val 
85 90 95 

lie Thr Trp Gly Ala Gin Thr Phe Lys His Gin Ala Phe Asn Lys Leu 
100 105 110 

Ala Asn Leu Phe lie Val Asn Asn Lys Lys Leu lie Pro Asn Asn Leu 
115 120 125 



Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe Met Asp 
130 135 140 



09 

|jj 

l^j Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Leu Asn Lys Ser lie 

i\ 145 150 155 160 



Val Leu Asn Thr Gin Ser Phe Thr Phe Glu Glu Val Cys Tyr Leu Val 
165 170 175 

Lys Gly Leu Arg Asn Lys Phe Gin Leu Asn Cys Tyr Val Lys lie Asn 
180 185 190 

Lys Asn Lys Pro lie lie Tyr lie Asp Ser Met Ser Tyr Leu lie Phe 
195 200 205 

Tyr Asn lie lie Lys Pro Tyr Leu lie Pro Gin Met Met Tyr Lys Leu 
210 215 220 

Pro Asn Thr lie Ser Ser Glu Thr Phe Leu Lys 
225 230 235 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

CCGGATCCAT GCATATGAAA AACATCAAAA AAAACCAGGT AATGAACCTG GGTCCGAACT 60 

CTAAACTGCT GAAAGAATAC AAATCCCAGC TGATCGAACT GAACATCGAA CAGTTCGAAG 12 0 

CAGGTATCGG TCTGATCCTG GGTGATGCTT ACATCCGTTC TCGTGATGAA GGTAAAACCT 18 0 

ACTGTATGCA GTTCGAGTGG AAAAACAAAG CATACATGGA CCACGTATGT CTGCTGTACG 24 0 

ATCAGTGGGT ACTGTCCCCG CCGCACAAAA AACAACGTGT TAACCACCTG GGTAACCTGG 30 0 

TAATCACCTG GGGCGCCCAG ACTTTCAAAC ACCAAGCTTT CAACAAACTG GCTAACCTGT 3 60 

TCATCGTTAA CAACAAAAAA ACCATCCCGA ACAACCTGGT TGAAAACTAC CTGACCCCGA 42 0 

TGTCTCTGGC ATACTGGTTC ATGGATGATG GTGGTAAATG GGATTACAAC AAAAACTCTA 4 80 

CCAACAAATC GATCGTACTG AACACCCAGT CTTTCACTTT CGAAGAAGTA GAATACCTGG 54 0 

Q TTAAGGGTCT GCGTAACAAA TTCCAACTGA ACTGTTACGT AAAAATCAAC AAAAACAAAC 600 

OP CGATCATCTA CATCGATTCT ATGTCTTACC TGATCTTCTA CAACCTGATC AAACCGTACC 660 

aJLs 

(fl TGATCCCGCA GATGATGTAC AAACTGCCGA ACACTATCTC CTCCGAAACT TTCCTGAAAT 72 0 

m 



m AATAAGTCGA CTGCAGGATC CGGTAAGTAA GTAA 754 



(2) INFORMATION FOR SEQ ID NO : 6 : 

13 

£* (i) SEQUENCE CHARACTERISTICS: 

y[ (A) LENGTH: 11 base pairs 

" (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



m 
q 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
AATGCTTTCC A 11 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



# 




TTTCACACAG GAAACAGCTA TGACCATGAT TACGAATTCT CATGTTTGAC AGCTTATCAT 600 

CGATAAGCTT TAATGCGGTA GTTTATCACA GTTAAATTGC TAACGCAGTC AGGCACCGTG 660 

TATGAAATCT AACAATGCGC TCATCGTCAT CCTCGGCACC GTCACCCTGG ATGCTGTAGG 72 0 

CATAGGCTTG GTTATGCCGG TACTGCCGGG CCTCTTGCGG GATATCCGCC TGATGCGTGA 7 80 

ACGTGACGGA CGTAACCACC GCGACATGTG TGTGCTGTTC CGCTGGGCAT GCCAGGACAA 84 0 

CTTCTGGTCC GGTAACGTGC TGAGCCCGGC CAAGCTTACT CCCCATCCCC CTGTTGACAA 90 0 

TTAATCATCG GCTCGTATAA TGTGTGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA 960 

CAGGATCCAT GCATATGAAA AACATCAAAA AAAACCAGGT AATGAACCTG GGTCCGAACT 102 0 

CTAAACTGCT GAAAGAATAC AAATCCCAGC TGATCGAACT GAACATCGAA CAGTTCGAAG 10 80 

CAGGTATCGG TCTGATCCTG GGTGATGCTT ACATCCGTTC TCGTGATGAA GGTAAAACCT 114 0 

Q ACTGTATGCA GTTCGAGTGG AAAAACAAAG CATACATGGA CCACGTATGT CTGCTGTACG 12 0 0 

m ATCAGTGGGT ACTGTCCCCG CCGCACAAAA AAGAACGTGT TAACCACCTG GGTAACCTGG 12 60 

w 

Ijj TAATCACCTG GGGCGCCCAG ACTTTCAAAC ACCAAGCTTT CAACAAACTG GCTAACCTGT 13 2 0 

jjl TCATCGTTAA CAACAAAAAA ACCATCCCGA ACAACCTGGT TGAAAACTAC CTGACCCCGA 13 80 

h 

TGTCTCTGGC ATACTGGTTC ATGGATGATG GTGGTAAATG GGATTACAAC AAAAACTCTA 144 0 

CCAACAAATC GATCGTACTG AACACCCAGT CTTTCACTTT CGAAGAAGTA GAATACCTGG 1500 

TTAAGGGTCT GCGTAACAAA TTCCAACTGA ACTGTTACGT AAAAATCAAC AAAAACAAAC 1560 

CGATCATCTA CATCGATTCT ATGTCTTACC TGATCTTCTA CAACCTGATC AAACCGTACC 162 0 

TCATCCCCCA GATGATGTAC AAACTGCCGA ACACTATCTC CTCCGAAACT TTCCTGAAAT 1680 

AATAAGTCGA CCTGCAGCCC AAGCTTGGCA CTGGCCGTCG TTTTACAACG TCGTGACT 173 8 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



Met Leu Val Arg • Gly Ala Glu Pro 
1 5 

Leu Phe Thr Val Pro Gly Leu Leu 
20 

Ser Cys Val lie Pro 
35 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino .acids 

(B) TYPE: amino acid 
(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 




Met Glu Lys Arg Gin Gin Arg Gly 
10 15 

Leu Ala Phe Cys Ser His Val Leu 
25 30 



Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

|jj Met Gin Leu Ala Arg Gin Val Ser Arg Leu Glu Ser Gly Gin 

Q 1 5 10 

CP 

jj, (2) INFORMATION FOR SEQ ID NO: 12: 

m 

.fa (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D ) TOPOLOGY : 1 inear 



o 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:12: 

Met Leu Pro Ala Arg Met Leu Cys Gly lie Val Ser Gly 
15 10 

(2) INFORMATION FOR SEQ ID NO : 13 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



Met Thr Met lie Thr Asn Ser His Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Met Lys Ser Asn Asn Ala Leu lie Val lie Leu Gly Thr Val Thr Leu 
15 10 15 

Asp Ala Val Gly lie Gly Leu Val Met Pro Val Leu Pro Gly Leu Leu 
© 20 25 30 

130 Arg Asp lie Arg Leu Met Arg Glu Arg Asp Gly Arg Asn His Arg Asp 

W 35 40 45 

m 
w - 

Met Cys Val Leu Phe Arg Trp Ala Cys Gin Asp Asn Phe Trp Ser Gly 
m 50 55 60 



3! 

o 



Asn Val Leu Ser Pro Ala Lys Leu Thr Pro His Pro Pro Val Asp Asn 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO : 15 : 



00 

im 

]** (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Met Cys Gly lie Val Ser Gly 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 



(ii) 



(D) TOPOLOGY: linear 
MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met His Met Lys Asn lie Lys Lys Asn Gin Val Met Asn Leu Gly Pro 
15 10 15 

Asn Ser Lys Leu Leu Lys Glu Tyr Lys Ser Gin Leu lie Glu Leu Asn 
20 25 30 

lie Glu Gin Phe Glu Ala Gly lie Gly Leu lie Leu Gly Asp Ala Tyr 
35 40 45 

lie Arg Ser Arg Asp Glu Gly Lys Thr Tyr Cys Met Gin Phe Glu Trp 
50 55 60 

Lys Asn Lys Ala Tyr Met Asp His Val Cys Leu Leu Tyr Asp Gin Trp 
65 70 75 80 

Val Leu Ser Pro Pro His Lys Lys Glu Arg Val Asn His Leu Gly Asn 
85 90 95 

Leu Val lie Thr Trp Gly Ala Gin Thr Phe Lys His Gin Ala Phe Asn 
100 105 110 

Lys Leu Ala Asn Leu Phe lie Val Asn Asn Lys Lys Thr lie Pro Asn 
115 120 125 

Asn Leu Val Glu Asn Tyr Leu Thr Pro Met Ser Leu Ala Tyr Trp Phe 
130 135 140 

Met Asp Asp Gly Gly Lys Trp Asp Tyr Asn Lys Asn Ser Thr Asn Lys 
145 150 155 160 

Ser lie Val Leu Asn Thr Gin Ser Phe Thr Phe Glu Glu Val Glu Tyr 
165 170 175 

Leu Val Lys Gly Leu Arg Asn Lys Phe Gin Leu Asn Cys Tyr Val Lys 
180 185 190 

lie Asn Lys Asn Lys Pro lie lie Tyr lie Asp Ser Met Ser Tyr Leu 
195 200 205 

lie Phe Tyr Asn Leu lie Lys Pro Tyr Leu lie Pro Gin Met Met Tyr 
210 215 220 

Lys Leu Pro Asn Thr lie Ser Ser Glu Thr Phe Leu Lys 
225 230 235 



• 



(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

CGCTAGGGAT AACAGGGTAA TATAGC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 base pairs 
p (B) TYPE: nucleic acid 

\Q (C) STRANDEDNESS: single 

jjg (D) TOPOLOGY: linear 

i . 3 

p (ii) MOLECULE TYPE: DNA (genomic) 

t 

U 3 
*0 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
GCGATCCCTA TTGTCCCATT ATATCG 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
TTCTCATGAT TAGCTCTAAT CCATGG 
(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 



• 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 
AAGAGTACTA ATCGAGATTA GGTACC 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

o 

y (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 

i 

jj^ CTTTGGTCAT CCAGAAGTAT ATATTT 

111 

/pi (2) INFORMATION FOR SEQ ID NO: 22: 

!U (i) SEQUENCE CHARACTERISTICS: 

*p (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :, single 
^ (D) TOPOLOGY: linear 

P 

H (ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
GAAACCAGTA GGTCTTCATA TATAAA 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 
TAACGGTCCT AAGGTAGCGA AATTCA 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY.: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 



a 

DO 

m 
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ATTGCCAGGA TTCCATCGCT TTAAGT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
TGACTCTCTT AAGGTAGCCA AATGCC 



(2) INFORMATION FOR SEQ ID NO: 26: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 
ACTGAGAGAA TTCCATCGGT TTACGG 



# 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

CGAGGTTTTG GTAACTATTT ATTACC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 base pairs 
p (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



w 

j™ (ii) MOLECULE TYPE: DNA (genomic) 



m 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 

? g CCTCCAAAAC CATTGATAAA TAATGG 

8-JL 

n (2) INFORMATION FOR SEQ ID NO: 29: 

m 

p (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
GGGTTCAAAA CGTCGTGAGA CAGTTT 
(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CCCAAGTTTT GCAGCACTCT GTCAAA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
GATGCTGTAG GCATAGGCTT GGTTAT 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO 
CTACGACATC CGTATCCGAA CCAATA 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 
CTTTCCGCAA CAGTATAATT TTATAA 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GAAAGGCGTT GTCATATTAA AATATT 

o 

(2) INFORMATION FOR SEQ ID NO: 35: 

ro 
w 
m 

I— 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TO POLOG Y : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 



2 

ID 

ACCATGGGGT CAAATGTCTT TCTGGG 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
TGGTACCCCA GTTTACAGAA AGACCC 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 

GTGCCTGAAT GATATTTATT ACCTTT 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 base pairs 
p (B) TYPE: nucleic acid 

% Q (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
GTGCCTGAAT GATATTTATT ACCTTT 
(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 
H= (A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 
CAACGCTCAG TAGATGTTTT CTTGGGTCTA CCGTTTAAT 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
GTTGCGAGTC ATCTACAAAA GAACCCAGAT GGCAAATTA 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CAAGCTTATG AGTATGAAGT GAACACGTTA TT 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
GTTCGAATAC TCATACTTCA CTTGTGCAAT AA 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 
GCTATTCGTT TTTATGTATC TTTTGCGTGT AGCTTTAA 3 8 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
CGATAAGCAA AAATACATAG AAAACGCACA TGGAAATT 3 8 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 



O 

m 

hi 

lire? 

jji 

m (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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M CCAAGCTCGA ATTCGCATGC TCTAGAGCTC GGTACCCGGG ATCCTGCAGT CGACGCTAGG 60 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:45: 



GATAACAGGG TAATACAGAT 80 
(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
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GGTTCGAGCT TAAGCGTACG AGATCTCGAG CCATGGGCCC TAGGACGTCA GCTGCGATCC 60 
CTATTGTCCC ATTATGTCTA 8 0 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
ATCAGATCTA AGCTTGCATG CCTGCAGGTC GACTCTAGAG GATCCCCGGG TACCGAGCTC 60 
GAATTCACTG GCCGTCGTTT 80 



jXj (2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 80 base pairs 
Jspj (B) TYPE: nucleic acid 

;k (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
TAGTCTAGAT TCGAACGTAC GGACGTCCAG CTGAGATCTC CTAGGGGCCC ATGGCTCGAG 60 
CTTAAGTGAC CGGCAGCAAA 80 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 




TACAACGTCG TGACTGGGAA AACCCTGGCG TTACCCAACT TAATCGCCTT GCAGCACATC 60 
CCCCTTTCGC CAGCTGGCGT 80 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
ATGTTGCAGC ACTGACCCTT TTGGGACCGC AATGGGTTGA ATTAGCGGAA CGTCGTGTAG 
GGGGAAAGCG GTCGACCGCA 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TAGGGATAAC AGGGTAAT 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 
ATCCCTATTG TCCCATTA 
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